[SharkInference-SharkRuntime] Adds capability to mmap vmfbs #1540

Abhishek-Varma · 2023-06-15T16:36:09Z

-- This commit is based on VmModule.mmap() API.
-- It thereby adds capability to mmap vmfbs in SHARK.

Signed-off-by: Abhishek Varma [email protected]

Abhishek-Varma · 2023-06-15T16:42:45Z

So, tempfile.NamedTemporaryFile needs to have delete=False set for us to get this to work on Windows.

Also, on Windows I see an API related issue in VmModule.mmap at this line. The error being :-

AttributeError: module 'mmap' has no attribute 'MAP_SHARED'

powderluv · 2023-06-20T02:19:43Z

This has landed again. iree-org/iree#14153

powderluv · 2023-06-20T02:20:20Z

shark/iree_utils/compile_utils.py

+        mmaped_vmfb = ireert.VmModule.mmap(instance, flatbuffer_blob_or_path)
+        context = ireert.VmContext(instance, modules=[hal_module, mmaped_vmfb])
+    else:
+        tmpf = tempfile.NamedTemporaryFile(delete=False)


this should be automatic now.

powderluv · 2023-06-21T00:08:59Z

Lets land this tomorrow so we can have mmap support

Abhishek-Varma · 2023-06-21T04:28:54Z

Oh! I missed this - sure, I'll take a look at the upstream changes - accordingly make changes to this PR, test and mark this ready for review.

Abhishek-Varma · 2023-06-21T08:38:20Z

So, I'm trying to see how to unlink the mapped file.
Because the callback leads to an error because by the time it is triggered the temporary file's name/data is lost.
Will test on Windows too post that - I see my setup was deleted.

Worst case - I'll keep the mmap loading confined only to those cases where we indeed have the vmfb generated and saved. Therefore getting rid of the temporary file's need.

Have made changes to from_flatbuffer API to use from_buffer API + the warning being set off.

Will update here on my progress.

@powderluv

Abhishek-Varma · 2023-06-21T16:45:50Z

I've verified the loading on CUDA initially.
I shifted to CPU (because CUDA VM got preempted) and worked on it - I've kept the unlinking limited only to temporary files but based on a few comments upstream I've commented the lines. Apart from the test script which I was using - I also verified if StableDiffusion's vmfb gets loaded properly or not.
On CPU I also verified that the execution takes the path of load_module where I'm not performing unlinking else the generated vmfb itself would get deleted!

On Windows I tested with my script - the script would take care of compiling a basic Module and explore the different paths Shark's compilation can take (even switching mmap ON/OFF).

After a while I switched over to CUDA and tried re-installing shark.venv using setup_venv.sh in my branch.
It gave the following error :-

Looking in links: https://llvm.github.io/torch-mlir/package-index/, https://nod-ai.github.io/SHARK-Runtime/pip-release-links.html, https://download.pytorch.org/whl/nightly/torch/
Obtaining file:///home/abhishek/my-shark
  Installing build dependencies ... error
  error: subprocess-exited-with-error
  
  × pip subprocess to install build dependencies did not run successfully.
  │ exit code: 1
  ╰─> [39 lines of output]
      Looking in links: https://llvm.github.io/torch-mlir/package-index/, https://nod-ai.github.io/SHARK-Runtime/pip-release-links.html, https://download.pytorch.org/whl/nightly/torch/
      Collecting setuptools>=42
        Using cached setuptools-68.0.0-py3-none-any.whl (804 kB)
      Collecting wheel
        Using cached wheel-0.40.0-py3-none-any.whl (64 kB)
      Collecting packaging
        Using cached packaging-23.1-py3-none-any.whl (48 kB)
      Collecting numpy>=1.22.4
        Using cached numpy-1.25.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (17.6 MB)
      Collecting torch-mlir>=20221021.633
        Using cached torch_mlir-20221213.686-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (221.7 MB)
      Collecting iree-compiler>=20221022.190
        Using cached iree_compiler-20230524.529-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (55.8 MB)
      Collecting iree-runtime>=20221022.190
        Using cached iree_runtime-20230524.529-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.6 MB)
      INFO: pip is looking at multiple versions of torch-mlir to determine which version is compatible with other requirements. This could take a while.
      Collecting torch-mlir>=20221021.633
        Using cached torch_mlir-20221212.685-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (221.7 MB)
        Using cached torch_mlir-20221211.684-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (221.6 MB)
        Using cached torch_mlir-20221210.683-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (221.6 MB)
        Using cached torch_mlir-20221209.682-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (221.6 MB)
        Using cached torch_mlir-20221208.681-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (221.6 MB)
        Using cached torch_mlir-20221206.71-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (219.6 MB)
      ERROR: Cannot install torch-mlir==20221206.71, torch-mlir==20221208.681, torch-mlir==20221209.682, torch-mlir==20221210.683, torch-mlir==20221211.684, torch-mlir==20221212.685 and torch-mlir==20221213.686 because these package versions have conflicting dependencies.
      
      The conflict is caused by:
          torch-mlir 20221213.686 depends on torch==2.0.0.dev20221211
          torch-mlir 20221212.685 depends on torch==2.0.0.dev20221211
          torch-mlir 20221211.684 depends on torch==1.14.0.dev20221205
          torch-mlir 20221210.683 depends on torch==1.14.0.dev20221205
          torch-mlir 20221209.682 depends on torch==1.14.0.dev20221205
          torch-mlir 20221208.681 depends on torch==1.14.0.dev20221205
          torch-mlir 20221206.71 depends on torch==1.14.0.dev20221122
      
      To fix this you could try to:
      1. loosen the range of package versions you've specified
      2. remove package versions to allow pip attempt to solve the dependency conflict
      
      ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts
      [end of output]
  
  note: This error originates from a subprocess, and is likely not a problem with pip.
error: subprocess-exited-with-error

× pip subprocess to install build dependencies did not run successfully.
│ exit code: 1
╰─> See above for output.

Even the iree-compiler's version is quite old 20230524.529 instead of 20230620.434 when I checked with pip list on my Linux CUDA VM.

I speculate that's why CI is failing as well. @monorimet

@powderluv

powderluv · 2023-06-21T21:24:51Z

Your in python 3.10. using 3.11 will fix it

-- This commit is based on [VmModule.mmap() API](iree-org/iree#14124). -- It thereby adds capability to mmap vmfbs in SHARK. Signed-off-by: Abhishek Varma <[email protected]>

monorimet

LGTM! Thanks for this.

Abhishek-Varma force-pushed the iree_mmap branch from 5ed2c77 to 189afec Compare June 15, 2023 16:38

Abhishek-Varma mentioned this pull request Jun 15, 2023

Add VmModule.mmap() to Python API. iree-org/iree#14124

Merged

powderluv reviewed Jun 20, 2023

View reviewed changes

Abhishek-Varma force-pushed the iree_mmap branch from 189afec to 204b000 Compare June 21, 2023 07:58

Abhishek-Varma force-pushed the iree_mmap branch from 204b000 to 13a7a1e Compare June 21, 2023 15:19

[SharkInference-SharkRuntime] Adds capability to mmap vmfbs

99d612d

-- This commit is based on [VmModule.mmap() API](iree-org/iree#14124). -- It thereby adds capability to mmap vmfbs in SHARK. Signed-off-by: Abhishek Varma <[email protected]>

Abhishek-Varma force-pushed the iree_mmap branch from 13a7a1e to 99d612d Compare June 22, 2023 12:27

Abhishek-Varma marked this pull request as ready for review June 22, 2023 13:24

Abhishek-Varma requested review from powderluv and monorimet June 22, 2023 13:24

monorimet approved these changes Jun 22, 2023

View reviewed changes

Abhishek-Varma merged commit cdd505e into nod-ai:main Jun 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SharkInference-SharkRuntime] Adds capability to mmap vmfbs #1540

[SharkInference-SharkRuntime] Adds capability to mmap vmfbs #1540

Abhishek-Varma commented Jun 15, 2023

Abhishek-Varma commented Jun 15, 2023

powderluv commented Jun 20, 2023

powderluv Jun 20, 2023

powderluv commented Jun 21, 2023

Abhishek-Varma commented Jun 21, 2023

Abhishek-Varma commented Jun 21, 2023

Abhishek-Varma commented Jun 21, 2023

powderluv commented Jun 21, 2023

monorimet left a comment

[SharkInference-SharkRuntime] Adds capability to mmap vmfbs #1540

[SharkInference-SharkRuntime] Adds capability to mmap vmfbs #1540

Conversation

Abhishek-Varma commented Jun 15, 2023

Abhishek-Varma commented Jun 15, 2023

powderluv commented Jun 20, 2023

powderluv Jun 20, 2023

Choose a reason for hiding this comment

powderluv commented Jun 21, 2023

Abhishek-Varma commented Jun 21, 2023

Abhishek-Varma commented Jun 21, 2023

Abhishek-Varma commented Jun 21, 2023

powderluv commented Jun 21, 2023

monorimet left a comment

Choose a reason for hiding this comment